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Abstract. Voids are a prominent feature of fractal point distributions but there is no precise definition of 
what is a void (except in one dimension). Here we propose a definition of voids that uses methods of discrete 

• stochastic geometry, in particular, Delaunay and Voronoi tessellations, and we construct a new algorithm 

' 

, to search for voids in a point set. We find and rank-order the voids of suitable examples of fractal point 

sets in one and two dimensions to test whether Zipf's power-law holds. We conclude affirmatively and 

in ' 

furthermore, that the rank-ordering of voids conveys similar information to the number-radius function, as 



oo 

o 

O 
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regards the scaling regime and the transition to homogeneity. So it is an alternative tool in the analysis of 



fractal point distributions with crossover to homogeneity and, in particular, of the distribution of galaxies. 

CO 



PACS. 05.45.Df Fractals - 02.50.-r Probability theory, stochastic processes, and statistics - 98.65.Dx 
Galaxy groups, clusters, and superclusters; large scale structure of the Universe 

1 Introduction well; namely, the number of gaps of length U greater than 

u is N(U > u) (x u~ D , where D is the fractal dimension. 



As is well known, self-similar fractal sets can be character- 
ized by a power-law number-radius function (variation of 
the number of points in a cluster with its radius). Accord- 
ingly, tests for fractality are made by measuring this func- 
tion and fitting it to a power law. On the other hand, Man- 
delbrot in his seminal work on fractals [1] introduced the 
concept of fractal holes, under the greek word tremas, and 

studied their distribution, showing that the distribution of Apart from Mandelbrot's general ideas, fractal holes 
one-dimensional holes (gaps) follows a simple power law as have not been a special object of study except in Cos- 



He also considered the higher dimensional generalization 
of this distribution for those fractals with mathematically 
well-defined holes, such as fractal carpets, sponges, foams, 
etc. These fractal sets have topological codimension one, so 
they are formed by curves in two dimensions or surfaces 
in three dimensions (etc.) which enclose empty holes. 
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mology 2 . The presence of large voids in the distribution that they should be merged into a unique void of irregular 

of galaxies is known to cosmologists since the late 70's, shape. This suggests to improve the definition of void by 

although their importance has only been recognized re- considering voids of arbitrary shape, in such a way that 

ccntly. The distribution and properties of these voids are fractal voids still fulfill the Zipf law. 

now a subject of systematic study [3>4:,5,6]. Mandelbrot Regarding self-similar fractals in nature, we must take 

considered tremas in the distribution of galaxies but, ac- into account that scale invariance has a limited range, 

cording to the observational situation at the time, favored between a lower and an upper cutoff. For scales above 

models with small voids and, actually, introduced the con- the upper cutoff the number-radius function N(r) oc r D 

cept of lacunarity to account for this feature. At any rate, a crosses over from D < d to homogeneity D = d [d is 

fractal model of the distribution of galaxies must have zero the embedding space dimension) . In cosmology very large 

topological dimension, so the concept of hole is not math- scale homogeneity is the basis of the standard Friedmann- 

ematically well defined. In fact, Mandelbrot's description Robertson- Walker model, but the transition to homogene- 

of the distribution of galaxies as having low lacunarity ity is controversial [9] . 

was based on visual impression. The procedure to define So the purpose of this work is: (i) to provide a def- 
voids in current cosmological studies is algorithmic: given inition of voids in fractal sets of points (zero topological 
a galaxy catalog, one designs an algorithm to find large dimension) with transition to homogeneity; namely, to de- 
empty regions sequentially (a void-finder) . Different void- vise a general algorithm to find voids in fractal sets of 
finders provide different lists of voids. Presumably, to have points in d dimensions showing a scaling (Zipf's law) that 
sound statistics of voids in the galaxy distribution or, gen- matches the scaling of voids in fractals in one dimension; 
erally, in abstract fractal sets one needs an adequate defi- (ii) then, to compare the Zipf law for the rank-ordering 
nition of void. of voids with the standard calculation of the fractal di- 
mension from the number-radius function, (iii) finally, to 

We proposed in Ref. [2] to apply to voids in the distri- 

indicate how to apply our conclusions to the voids in the 

bution of galaxies rank-ordering techniques [2l[8] . A power- 
galaxy distribution. 

law rank order is Zipf's law [7] and is equivalent to the 
law N(U > u) (x u~ D . We defined voids in a particularly 

2 Scaling of voids 

simple manner, namely, as having constant shape (circles, 

squares, etc.) and showed that Zipf's law is fulfilled by It is easy to reach conclusions on the scaling of voids in 

fractal voids independently of the chosen shape. However, deterministic Cantor-like fractals constructed by a recur- 

this prescription for voids is far from satisfactory: often sive procedure, especially in one dimension [2J. The frac- 

voids of similar size touch each other and visually it seems tal generator is characterized by three independent num- 
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bers: r, N, m. The first number, r < 1, is the scaling fac- sion, which is the dimension of the embeddding Euclidean 

tor: a rf-dimensional unit cube is divided into l/r d scaled space (d) minus the topological dimension, 
cubes of size r d . One keeps N of the resulting l/r d cubes In the construct ion of Cantor-like fractals, let us as- 

to repeat the process and removes the others, leaving m sume ^at every removed trema gives rise to a unique void 

tremas, with ranks from R x = 1 to m and average size of thc limit set) which is truc if the generator does not in- 



Ai = (1 - r d N)/m. Of course, 1 < m < l/r d - N; the 



elude any trema at the boundary so tremas removed in 



maximum number of tremas m = l/r d - N is reached different iterations cannot merge. This condition is very 

when they are elementary cubes with no common facet. natural in one dimension (merging in one dimension is 

In the second iteration, there will be mN tremas with Ynmted, anyway ). i n d > 1 it ensures that the fractal 

ranks from R 2 = m + 1 to m + mN and average size get nas topological codimension one, because it contains 

^2 = r d (l - r d N)/m, etc. The fractal dimension D = thc bound aries of tremas which have themselves codimcn- 

— log N/ log r is independent of m. In the fc-th iteration gion Qne Typical fractals generated in this form are the 

(fc > 1), R k cx N*- 1 and A k oc (r d ) k -\ so thc function Sierpinski carpets [J. Therefore, by identifying removed 

A(R) is a power law with average slope -d/D. tremas with yoidg of the Umit ^ we have the zipf law 



A fractal set is partly characterized by its fractal di- 
mension. Dimension is a central concept in fractal geom- 
etry but it admits several definitions pQ. Most impor- 
tant are the topological dimension and the Hausdorff- 
Besicovitch dimension. Indeed, a fractal is conventionally 
defined as a set for which the former dimension is strictly If the fractal set does not have topological codimen 



A(R) oc R~ d / D . We remark that this law holds on average, 
possibly with log-periodic corrections: within every period 
the size of voids can be constant or change in an almost 
arbitrary way. Log-periodic corrections are the hallmark 
of discrete scale invariance and naturally appear in Cantor 
like fractals [5]. 



smaller than the latter (which is simply called fractal di- sion one and > in particular, if it has topological dimension 

mension). The fractal dimension of a self-similar fractal zero ( in d > *)> the merging of generator tremas at dif- 

can be calculated from the number-radius relation. The fercnt levels leads to the complementary of the fractal set 

topological dimension of a set is a topological invariant bein S connected. This demands a criterium for separation 

that can be determined from the properties of its covering °^ v °i ds - 

by small open sets [10] . A set of topological dimension zero We must remark that the last number defining the gen- 
is what we intuitively call a set of points, a set of topolog- erator, namely, the number of tremas m, has no effect on 
ical dimension one is what we intuitively call a curve (or dimension but characterizes the morphology of the fractal, 
set of curves), etc. Here we use the topological codimen- For given r and N, and therefore given fractal dimension, 
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the largest m = l/r d — N corresponds to the smallest centers of the circumscribing spheres. Each Voronoi cell is 
tremas that the generator can have, and viceversa. There- the neighbourhood of a point of the set, in the sense that 
fore, m is a measure of the size of tremas, that is, of lacu- the points of the interior of the cell are closer to that point 
narity. We recall that the notion of lacunarity (from the than to any other point of the set. 
latin word "lacuna" , meaning lake) was defined by Man- 
delbrot pQ as an intuitive measure of the size of tremas: 

The Delaunay and Voronoi tessellations are fundamcn- 

fractals with larger tremas (and the same dimension) are 

tal constructions associated to a point set. In the search 

more "lacunar" . It is easy to prove that the multiplicative 

for voids, the defining property of the Delaunay tessella- 

factor in the Zipf law A(R) cx R~ d / D is related with m (in 

tion is obviously adequate: we can consider each Delaunay 

addition to r and N) [2J. In the log- log plot of the Zipf 

simplex as an elementary void and define voids by join- 
law, the multiplicative factor becomes an additive factor 

ing adjacent simplices according to some criterium. The 

so it only affects the overall position of the plot along the 

natural criterium is given by the overlap of circumscribing 

ordinate axis. This effect of lacunarity is trivial. However, 

spheres corresponding to adjacent simplices being above a 

there is no unique definition of lacunarity. For example, a 

predetermined threshold. This overlap criterium has been 

more elaborate definition of lacunarity relates it with the 

employed in various manners in other void-finders that use 

three-point correlation function [llj . In fact, higher order 

spheres 4j5]|[6]. A simple and efficient way to implement 

correlations contain morphological information that goes 

it is to demand that the distance between the centers of 

beyond lacunarity, including information on the shape of 

two overlapping spheres be less than a given fraction / of 

voids. 

the smaller radius (see Fig.[T]). This criterium generalizes 
the similar criterium in Ref. [5] by adding the fraction / 

3 Discrete geometry methods and void-finder 

as a free parameter (set to one in that reference) . The ad- 
To design a robust void finding algorithm, it is natural vantage of having a free parameter is that one has some 
to rely on concepts of discrete stochastic geometry (in- control on the shape of the voids: for / <C 1 we just have 
troduced in cosmology with different objectives by Rien the elementary voids, that is, the simplices of the Delau- 
van de Weygaert and collaborators [H]). The Delaunay nay tessellation, but for / ~ 1 the voids adopt a rami- 
tessellation of a <i-dimensional point set consists of a set fied shape and the largest void may percolate through the 
of links between the points, forming simplices such that sample. When this happens, the percolating void takes 
their respective circumscribing <i-spheres do not contain too large a fraction of the total size and imbalances the 
any other point of the set. This tessellation is unique. The statistics. Actually, the imbalance effect takes place before 
Voronoi tessellation is the dual tessellation formed by the percolation and it is best to keep / sufficiently small. 
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2. Sort the triangles of the Delaunay triangulation by size 
and select the largest one to begin to build the first 
void. 

3. Grow the void by adding adjacent triangles (in this 
process, the Voronoi tessellation is useful). A triangle 
is added if the overlap criterium is met (for the chosen 
/). The set of all triangles found constitutes the void. 

4. Iterate by finding the largest triangle among the re- 
maining ones until they are exhausted. 



Fig. 1. Test set of 16 points showing the relevant discrete ge- 
ometry constructions: the Delaunay triangulation, the Voronoi 
cells of interior points, and the circles circumscribed to two 
adjacent triangles, with the segment linking their centers in 
boldface and the radius of the smaller circle in dashed bold- 
face. Our overlap parameter / is a bound to the ratio of the 
length of the segment linking their centers to the length of the 
radius of the smaller circle. 

We have devised the algorithm for two-dimensional 
point sets but the generalization to three (or more) di- 
mensions is straightforward. To begin the search for voids 
we need to estimate where we may find the largest one, 
but we cannot measure the size of a void until it has been 
found; so we look instead for the largest Delaunay triangle. 
The algorithm consists of the following steps: 



1. Construct the Delaunay triangulation and Voronoi tes- 
sellation for the given point set. 



4 Rank-ordering of voids and Zipf's law 

Before applying our void-finder to fractals with transition 
to homogeneity, let us consider the effect of this transition 
on the Zipf law. A transition to homogeneity in a one- 
dimensional random Cantor-like fractal can be achieved 
by joining by the ends several realizations of it. For the 
sake of the argument, let us assume that a single real- 
ization follows a perfect Zipf law An = A\R~^ . Then we 
have in 10 copies of the fractal, say, 10 voids with size 
A\ and ranks R — 1, . . . , 10, 10 voids with size Ai and 
ranks R — 11,..., 20, etc. So the sizes follow the law 
logylion = log ill + Cl°gl0 — Cl°g(10 n )j A being con- 
stant between ranks lOn — 9 and lOrt. This is a stepcase 
with steps of exponentially decreasing width and linearly 
descending ends with slope £. Relaxing the condition of an 
initial perfect Zipf law, the steps become smooth, so we 
conclude that the effect of the transition to homogeneity 
is the flattening of the Zipf law for small ranks and that 
the width of the flattened portion measures the scale of 
homogeneity. 
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Fig. 2. (Top) number-radius function for a D — 1/2 fractal 
set in d = 1 (logarithms are to base v2 and the total size is 
normalized to unity), and (bottom) Zipf's law for the rank- 
ordering of its voids (gaps). 

A coarse-graining procedure leads to a different picture 
of the transition to homogeneity, which can be considered 
a bottom-up picture, opposed to the preceding top-down 
picture. A fractal is invariant under coarse-graining while 
in the scaling range, but eventually the coarse-grained par- 
ticles are no longer correlated and correspond to a homo- 
geneous distribution. Hence, we deduce that the largest 
voids are the ones between the random (uncorrelated) 
coarse particles corresponding to a homogeneous distribu- 
tion. So it will be useful to compare the small-rank voids 
with the ones of a random distribution of particles. 



1 voids and a new void-finder 

To generate random fractals we use either random Cantor- 
like algorithms or fractional Brownian motion (FBM) meth- 
ods [13] . A fractional Brownian motion or its generaliza- 
tion to more than one dependent variable are not self- 
similar but self-affine [1,13J. However, appropriate sec- 
tions are self-similar, so fractional Brownian functions con- 
stitute an adequate method to generate random fractals, 
especially adequate when a precise control of their fractal 
dimension is useful. We use the method of spectral syn- 
thesis. We start from an array of wave- numbers, with a 
given power-law power spectrum (with the appropriate 
Hurst exponent) and random phases, and such that the 
amplitudes corresponding to opposite wave-numbers are 
conjugate, to yield real data in real space. Then we per- 
form a fast Fourier transform to obtain these real-space 
data. These self-affine data are then subjected to the ap- 
propriate sections to construct the final self-similar fractal 
point set. 

In Fig. [5] we have plotted the number-radius function 
and the rank-ordering of voids for a FBM D = 1/2 frac- 
tal with 2884 points in d — 1, with transition to homo- 
geneity. This transition appears in the crossover of the 
number-radius function from N(r) oc r 1 / 2 to N(r) oc r 
on scales about fifteen times smaller than the total size. 
This crossover has its counterpart in the flatness of the 
rank-ordering of large voids (on similar scales). Note that 
the flattening is progressive, but the largest voids have 
approximately the same size. 

In two dimensions, we have run our void- finder on vari- 
ous fractal point sets with different dimensions and various 



Jose Gaite: Zipf's law for 




Fig. 3. Voids of a random Cantor fractal set with D — 1 and 
12288 points (corresponding to overlap parameter / = 0.3). 
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Fig. 4. (Top) number-radius function of the D = 1 fractal 
point set in d — 2 (Fig. |3}, and (bottom) Zipf's law for its 
voids (corresponding to the displayed /). 
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values of the overlap parameter /. As expected, Zipf's law 
is obtained; however, to get the maximum scaling range 
(that is, similar to the scaling range of the number func- 
tion), the value of / must be properly tuned. To a certain 
extent, this is due to the fact that one must keep away 
from the percolation threshold. Fig. [3] shows the voids 
found in a fractal with dimension D = 1 and 12288 points. 
Fig. 2] shows its number-radius function and the Zipf law 
for voids. We observe that the transition to homogeneity 
has a similar representation in both plots, namely, both 
graphs are approximately congruent. 

We have determined the fractal dimension from either 
the values of N(r) or the sizes of voids. From the number- 
radius function data we estimate D by a least-squares fit 
of the linear log-log relation. The result depends on the 
portion we choose to fit, that is, on the lower and upper 
cutoffs. The best fit of log N(R) in Fig. H (top) is found for 
the range 10-15, yielding D = 1.011 ± 0.004. The ends of 
this range correspond to r = 0.0055 and r — 0.031, that 
is, a scale factor of only (\/2) 5 ~ 5.7, O.but clearly the 
power law holds in a larger range: for example, 0. fitting 
the range 4-19 (scale factor 181) yields D = 1.035±0.004. 
We apply a similar method to the rank-ordering of voids 
0.(Fig.|H bottom), taking into account the discrete nature 
of O.the rank. The best fit is found for the range 512- 
8192 0.(2 9 -2 13 ), yielding 2/D = 1.986 ±0.010. The ends 
of O.this range correspond to A 1 / 2 = 0.0067 and A 1 ! 2 = 
0.0.00042, that is, a scale factor 16, larger than the best- 
fit 0. scale factor for the number-radius function. However, 
the largest 0. scaling ranges are similar in both cases. 
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Fig. 6. Transition to homogeneity of the Zipf's law for the 
voids of the D — 1 fractal point set in d — 2 (continuous 
grey line) compared with the voids of a random distribution of 
points having small-rank voids of the same size (dashed line). 
The overlap fraction used for both is / = 0.3 (as displayed). 
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Fig. 5. (Top) number-radius function of a D — 1.26 fractal 
set in d = 2 with 9216 points, and (bottom) Zipf's law for its 
voids (corresponding to the displayed /). 

For further illustration, we include another example in 
d = 2, namely, a fractal with dimension D = 1.26 and 
9216 points. Fig. [5] shows its number-radius function and 
the Zipf law for voids. The best fit of log N(R) in Fig. 
(top) is found for the range 7-13, yielding D = 1.27±0.02. 
We can also fit the slope of the rank-ordering of voids 
(taking into account the discrete nature of the rank) to 
give a quantitative measure of Zipf's law (Fig.[5j bottom). 
For example, a fit of the range 256-8192 (2 8 -2 13 ) yields 
2/D= 1.51 ±0.02. 

As explained above, the precise way in which the scal- 
ing region of a Zipf plot crosses over to homogeneity, namely, 
the way it flattens, can be simulated by considering the 
corresponding random distribution of coarse-grained par- 



ticles. Indeed, looking at the pattern of voids in Fig. [3j 
it is obvious that small voids between points in a cluster 
are approximately independent of voids between clusters. 
The largest coarse-grained particles correspond to clusters 
that are, essentially, distributed randomly. So the largest 
voids must follow the law of distribution of voids in a ran- 
dom point distribution. To check it, we have applied our 
void-finder to a random distribution of 57 points in the 
unit square, which happens to have smallest-rank voids of 
the same size as the smallest-rank voids of the fractal set 
in Fig. [3] Then we have superposed the respective log-log 
plots of rank-ordered voids in Fig. [5] (joining the points 
corresponding to individual voids). Both plots agree suf- 
ficiently well, within statistical errors, down to the rank 
where the scaling regime begins. There the random point 
distribution has essentially no more voids (except for a 
few small ones) and the corresponding line falls abruptly. 
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features like lacunarity) has value on its own but the exis- 
tence of various morphologies corresponding to the same 
scaling dimension poses difficulties for void-finders. Espe- 
cially, void-finders need to adapt to the particular shape 
of voids of the given fractal. Having tunable parameters in 
void-finder helps. Clearly, these parameters must only 
depend on relative magnitudes, like the one we have used 
Fig. 7. Log- linear plot of the decrease with rank of the Hill (/), but one has some liberty in their choice, nevertheless. 
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estimator for the Zipf law for the D = 1 fractal voids, showing 
convergence towards D/2 = 0.5. 
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The rank-ordering of voids is likely to be useful for the 
analysis of fractal distribution such that information on 
To derive the exponent of the Zipf law we can also their voids is readily available. Regarding the distribution 
apply maximum-likelihood estimation through Hill's esti- of galaxies, and considering the great amount of informa- 
mator [5] tion on its voids that is being compiled, we infer that it is 

y m -— , convenient to try to establish the Zipf law. Moreover, we 

i—l 

i n mi i; r i have demonstrated that the measure of the fractal dimen- 

takmg data down to rank H. Ine result for our example 

, „ , . n . is, m , „ . ,. . sion D provided by this law can reach similar accuracy to 

is plotted in .big. |7J ihe advantage of this estimator is 

, n • i -i the one given by the number-radius relation, which is the 

that it allows us to see how the small voids progressively 

. ., r , r „ n , standard method of measuring it. In addition, the rank- 

contribute to the measure of the value of D and where 

, n n . . i t i • i •, i ordering of voids also provides the scale of transition to 

the fluctuations due to discreteness begin to make it less ° 

homogeneity, and in a very intuitive manner, since it is 
precise. ° J 

realted with the size of the largest voids. This information 
is already available in the catalogues of galaxy voids. In 

5 Discussion 

contrast, the search for a scaling range in these catalogues 
We must stress that the rank-ordering of voids and the cor- fails [5] • Hopefully, the use of simple and well-defined void- 
responding Zipf's law convey no more information than finders, such as the one proposed here, and their use in the 
the number-radius function, as regards scaling. In fact, compilation of more complete catalogues of galaxy voids 
one may need to tune somewhat the void-finder parame- will lead to the observation of the scaling of voids and 
ter to extract the same information. This is probably due to independent measures of the fractal dimension and the 
to the fact that the morphology of voids depends on the scale of transition to homogeneity of the galaxy distribu- 
type of fractal. The information on morphology (including tion. 
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